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Abstract 

We prove that the distribution function of the largest eigenvalue in the Gaussian Unitary 
Ensemble (GUE) in the edge scaling limit is expressible in terms of Painleve II. Our goal is 
to concentrate on this important example of the connection between random matrix theory 
and integrable systems, and in so doing to introduce the newcomer to the subject as a whole. 
We also give sketches of the results for the limiting distribution of the largest eigenvalue in 
the Gaussian Orthogonal Ensemble (GOE) and the Gaussian Symplectic Ensemble (GSE). 
This work we did some years ago in a more general setting. These notes, therefore, are not 
meant for experts in the field. 

1 Introduction 



For the finite N random matrix models GOE, GUE and GSE, the probability that no 
eigenvalues lie in a set J, Ep(0; J), equals a Frcdholm determinant of an integral operator 
with scalar kernel for GUE ((3 — 2) and equals the square root of a Fredholm determinant 
of an integral operator with matrix kernel for GOE (/? = 1) and GSE (/3 = 4). Gaudin jlTj 
(using Mehta's newly invented method of orthogonal polynomials p^| ) first discoverd this 
connection between random matrix theory and Fredholm determinants, for GUE in the bulk 
scaling limit, in 1961. In 1970 Dyson |ll| showed that the n-point correlations for circular 
ensembles are expressible as quaternion determinants. The next year Mehta | j34| extended 
Dyson's formalism to include the GOE and the GSE. From these representations of the n- 
point correlations, one deduces that Ep(0; J) 2 (/? = 1,4) are Fredholm determinants. Later 
several authors generalized this construction to other invariant ensembles. Sec |4(], [[l| for 
simplified proofs of these facts. 
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2 Distribution of Largest Eigenvalue, [3 = 2 



2.1 Fredholm Determinant Representation 

In finite N GUE (see, e.g., f| |§) 

E N>2 (0; J) = det 
where if jv,2 is the integral operator 

(K N ,2f) ( x ) = J ^ K Nf2 {x, y)f{y) dy, 



K N , 2 {x,y) = , (1 

/iV\ 1/4 

¥>(» = I y I ¥>Jv(a;), (2) 



^0*0 = (^yj <Pn-i(?) ( 3 ) 
and the ifk(x) are the harmonic oscillator wave functions 

Vfc(as) ^ 7= * e x2 ' 2 H k (x). 

(The Hk{x) are the Hermite polynomials and the sequence {<yS/c}fc>o forms a complete 
orthonormal sequence in L 2 (R).) 

For J = (t, oo), the probability that no eigenvalues lies in J is the same as the proba- 
bility that all eigenvalues lie to the left of t; and in particular, that the largest eigenvalue, 
Amax(^V), lies to the left of t: 

F N , 2 {t) ■= Prob (Amax(A0 < t) = E 2 (0; (t, oo)) . 

(Since the distribution functions all have continuous densities, we need not worry about 
the difference between < and <„) 



2.2 Edge Scaling Limit 

Fixing t, Prob (\m&x(N) < t) — > as N — > oo since the largest eigenvalue increases with 
N; and hence, the probability that it remains less than any hxed number tends to zero. To 
obtain, therefore, a nontrivial limiting distribution we must introduce a normalized random 
variable. Such situations are common in probability theory, e.g. the central limit theorem. 
The density of eigenvalues at x, p(x), equals K^ t2 (x,x). Replacing Kjsr t2 (x,y) by 

W) K ^( z+ wr z+ W)) 

has the effect of making the point z in the spectrum the new origin and rescaling so that the 
eigenvalue density at this point becomes equal to 1. The "edge of the spectrum" corresponds 
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to z ~ ±\2N. (The constant V2 results from the choice of the standard deviation of the 
matrix elements in GUE — here we choose a standard deviation equal to l/v2 which is the 
same as in Mehta |35|.) The edge scaling limit jl0|, |l6) for the kernel is then 

jfe, ^< 2 + 2V^' ^ + 2V^) = W*' V) (4) 

where ^Airy nas ^ ne f° rm (0) with 

<p(x) — Ai(a;), ip(x) = Ai'(x), 
and Ai(x) the Airy function. Given t and N, we define s by 

f = Vm 



2 l/2 N l/6- 



Using (^) and the Fredholm expansion of det(I — Kn .2), it follows as N — > 00 and t — > 00 
such that s is fixed that (the edge scaling limit) 



lim Fn,2 (t) = F 2 (s) = det (i - K A 



rry 

where A^j r y acts on L 2 ((s, 00)). 

Remarks: Inspecting (0) and (^) we see that one needs asymptotic formulas for the 
Hermite polynomials Hk(x) for large values of both x and k. Fortunately, such formulas 
are known and were derived by Plancherel and Rotach in 1929. (They can be found in 
the Batcman Manuscripts.) Using these asymptotic formulas it is then straighforward to 
derive ([|). Recent progress has been made in deriving analogous Plancherel- Rotach type 
formulas for orthogonal polynomials whose weig ht function is of the form e - y §, Thc 
proofs are considerably more difficult than in the classical Hermite case. These formulas 
then allow one to prove that the Airy kernel arises in the edge scaling limit under much 
more general circumstances. 

2.3 Painleve II Representation for F 2 

For notational convenience we denote -ft^Airy by K in this subsection. It will also be 
convenient to think of our operator K as acting, not on (s, 00), but on R and to have 
kernel 

K{x,y)x{y) 

where x is the characteristic function of J. Since the integral operator K is trace-class and 
depends smoothly on the parameter s, we have the well known formula 

^logdet(/-^) = -tr(^(/-^)- 1 ^j. (5) 

By calculus 

^ = -K(x,s)S(y-s). (6) 

(If L is an operator with kernel L(x,y) we denote this by L = L(x,y).) Substituting this 
into the above expression gives 

4- log det (I-K) = -R(s, s) 
as 



3 



where R(x,y) is the resolvent kernel of K, i.e. R = (I — K)~ 1 K = R(x,y). The resolvent 
kernel R(x,y) is smooth in x but discontinuous in y at y — s. The quantity R(s,s) is 
interpreted to mean 

lim R(s,y)- 
2.3.1 Representation for R(x, y) 

If M denotes the multiplication operator, (Mf)(x) = xf(x), then 

[M, K] = xK(x, y) - K(x, y)y = (x - y)K{x, y) = cp(x)ip(y) - ip(x)(p(y). 
As an operator equation this is 

[M, K] = ip®ip-%l)®ip. 
(We define a <g> b = a(x)b(y) and let [■, •] denote the commutator.) Thus 



M, (7 -if)" 1 



= (I - K) 1 [M, K] {I - K) 1 

= (/-irr 1 {v® i>- ip® <p) (i-Ky 1 

= Q®P-P®Q 



where we have introduced 



Q{x; s) = Q{x) = {I - K)' 1 ip and P(x; s) = P(x) = {I - K)~ l ip. 
On the other hand since (I — K)^ 1 = p(x. y) = 5{x — y) + R(x, y), 

M, (I - KY 1 ^ = (x - y)p{x, y) = (x- y)R(x, y). 

Comparing (fy and (||) we see that 



x-y 



Taking y — > a; gives 
Introducing 
we have 



E(a;,x) = Q'(x)F(.t) - P'(a:)Q(a:). 
<?(s) = Q(s; s) and p(s) = P(s; s), 



(7) 
(8) 

(9) 

(10) 

(11) 
(12) 

(13) 



R(s, s) = Q'(s; s)p(s) — P'(s; s)q(s), s < x,y < oo. 

The reader may have noted that the expression jlO| ) for i?(x, y) depends only upon the 
kernel K(x,y) having the form ([!]). As far as the authors are aware, the generality of this 
representation for R(x,y) was first stressed by Its, et al. ^3|, though it appears in the 
context of the sine kernel in the work of Jimbo, et al. |26|. 
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2.3.2 Formulas for Q'[x) and P'(x) 

As we just saw, we need expressions for Q'(x) and P'(x). If D denotes the differentiation 
operator, d/dx, then 



y(x;s) = D(I-K)-\ 

= (i - Ky 1 Dip + \d, (I — K)~ 
= (i - xy 1 -0 + \d, (i - xy 1 
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P(x)+ D,(I-KY 



cp. 



(14) 



We need the commutator 

D,(I-K) 
Integration by parts shows 

[D,K] = 



= {I-K)- 1 [D,K] (I-K)-\ 



dK dK 
dx dy 



+ K{x, s)5{y — s). 



(The 8 function comes from differentiating the characteristic function Using the specific 
form for tp and "0 {<p' = ip, ip' = xp) we compute: 

dK 8K\ 

ttt + it = <p{ x my)- 



dx dy 



Thus 



-Q(x)Q(y) + R(x, s)p(s, y). 



D,[I-K)- X 

(Recall (/ — Ky 1 = p(x,y).) We now use this in @ 

Q'(x;s) = P(x)-Q{x)(Q,p) + R(x,s)q(s) 
= P(x) - Q(x)u(s) + R(x,s)q(s) 

where the inner product (Q,p) is denoted by u(s). Evaluating at x — s gives 

Q'(s; s) = p(s) - q(s)u(s) + R(s, s)q(s). 



(15) 



(16) 



We now apply the same procedure to compute P' encountering the one new feature that 
since ip'(x) = xtp(x) we need to introduce an additional commutator term: 

P'(x;s) = D(J-Ky x i) 

= (i-Ky 1 Dip+ D^i-Ky 1 v 

= m (i — xy 1 <p+Ui- xy 1 , m <p + \d, (i - xy 1 ] ip 

= xQ(x) + {P®Q-Q®P)p + (-Q (g) Q)ip + R(x, s)p(s) 

= xQ(x) + P{x) (Q, p) - Q(x) (P, p) - Q{x) (Q, 0) + R(x, s)p(s) 

= xQ(x) - 2Q(x)v(s) + P(x)u(s) + R(x, s)p(s). 



5 



Here v = (P, ip) = (i/j,Q). Evaluating at x = s gives 

P'(s;s) = sq(s) + 2q(s)v(s) + p(s)u(s) + R(s , s)p(s) . 



Using this and the expression for Q'(s; s) in (13) gives 

R(s, s)=p 2 - sq 2 + 2q 2 v - 2pqu. (17) 

2.3.3 First order equations for q, p, u and v 

By the chain rule 



ds \ dx ds 

We have already computed the partial of Q(x; s) with respect to x. The partial with respect 
to s is 

= -R{x,s)q(s) 

where we used (|^). Adding the two partial derivatives and evaluating at x = s gives 

dq 

— =p-qu. (19) 
ds 

A similar calculation gives 

^ = sq - 2qv + pu. (20) 
ds 

We derive first order differential equations for u and v by differentiating the inner products: 

/■OO 

u(s) = / Lp(x)Q(x; s) dx, 



du f°° dQi^s) 

- = -<p(s)q(s)+ vW—^—dx 



- <P{ S ) + / R(s,x)ip(x)dx q(s) 



Similarly, 



-(I-K) \(s)q(s) 
dv 



ds 



2.3.4 Painleve II and F 2 



From the first order differential equations for q, u and v it follows immediately that the 
derivative of u 2 — 2v — q 2 is zero. Examining the behavior near s — oo to check that the 
constant of integration is zero then gives 

u 2 — 2v == q 2 , 
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a first integral. We now differentiate ( |l9| ) with respect to s, use the first order differential 
equations for p and u, and then the first integral to deduce that q satisfies the Painleve II 
equation 

q" = sq + 2q 3 . (21) 

Checking the asymptotics of the Fredholm determinant det(J — K) for large s shows we 
want the solution to the Painleve II equation with boundary condition 

q(s) ~ Ai(s) as s — > oo. (22) 

That a solution q exists and is unique follows from the representation of the Fredholm 
determinant in terms of it. Independent proofs of this, as well as the asymptotics as s — > oo 
were given by |^2|, [ll], [lj] . 

Since the kernel of [D, (I - K)- 1 ] is (d/dx + d/dy)R{x, y), (£f) says 

d d\ 

— + — y) = -Q(x)Q(y) + R(x, s)p(s, y). 



^dx dy 

In computing dQ(x; s)/ds we showed that 



§- s (I~K) 1 = -^- s R(x,y) = -R(x,s)p(s,y). 



Adding these two expressions, 



d d d 

«- + «- + ir ) flfo y) = -Q(z)Q(y), 



v 9x (9y 9s 
and then evaluating at x = y = s gives 

±R(s,s) = -q 2 . (23) 
as 

Integration (and recalling (||)) gives, 

d f°° 
— log det (I — K) = — / (x) cte; 

and hence, 

/oo / />oo \ />oo 

I / <7 2 (a;)efo) dy = - I (x - s)q 2 (x) dx. 

To summarize, we have shown that the distribution function F2 has the Painleve repre- 
sentation 

/ r°° 

(x- s)q 2 (x)dx) (24) 



F 2 (s)=exp(-J 



where q satifies the Painleve II equation ( |2l|) subject to the boundary condition (|22|). 
Alternatively, it is easy to check that (p3h integrates to 

R(s,s) = (q') 2 -sq 2 -q\ (25) 



This and together with ( |2l| ) shows that i?(s, s) itself satisfies 

(i?") 2 + 4i?' ((i?') 2 - + R) = 0. 

This last differential equation is the Jimbo-Miwa-Okamoto [g7| |36| er form for Painleve II. 
(Observe that (p5|) and Jl?]) give another identity between q, p, u and v.) 
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2.4 Remarks 



1. In j43| the general case J = (J(a2j_i, a,2j) is first considered and then specialized to 
J = (s, oo). Here we have simply given the proof of ]43fl for this special case from the 
very beginning. 

2. The asymptotics of i*2(s) as s — > — oo are important and they require the solution of 
a connection problem for Painleve II. This connection problem is solved in [ll], |l4| 
and applied in Q to F 2 with the result that 

The constant tq is conjectured to equal ( _1 )2 1 / 24 . 

3. For the distribution of the next-largest, next-next-largest, etc. eigenvalues, see pjjfl . 

4. The connection between certain classes of Fredholm determinants and integrable sys- 
tems of the Painleve type begins with work on the two-dimensional Ising model |53| , 
|32]] . In this model the spin-spin correlation functions in the scaling limit are expressed 
in terms of a third Painleve transcendent. Painleve functions first appear in random 
matrix theory in prjf where the Fredholm determinant of the sine kernel, J = (—t,t), 
is given in terms of a fifth Painleve transcendent. 

5. In a general theory was developed that relates Fredholm determinants of operators 
K with kernel of the form ([!]) acting on J = \J(a,2j-i, o,2j) with integrable systems. 
This theory assumes that the functions <p and ip satisfy a first order linear system of 
differential equations with rational coefficients. For related developments sec ^l], 

MM. 



3 Orthogonal and Symplectic Ensembles 

The probabilities, Ep(0; J), that a set J consisting of a finite union of open intervals contains 
no eigenvalues for the finite N GOE ((3 — 1) and GSE ((3 — 4) are more difficult to compute 
than are the probabilities for the finite GUE ((3 — 2). The essential problem comes down 
to the fact that we must deal with Fredholm determinants of operators with matrix-valued 
kernels 

Changing notation slightly, we denote by S the operator -ftjv,2 of the previous section 
and let <p and ip be as before; namely, (^ and (^), respectively. Let e equal the operator with 
kernel e(x — y) = ^sgn(x — y) (sgn is the sign function) and D the differentiation operator. 
It then follows from Mehta's representation |54j of the n-point correlations [[15], [l6| and for 
N even (this case is slightly simpler) that 

En,i(0; J) 2 = dot (I-K N ,i) 

where 

_ / S + ip®eip SD-ij)®ip \ 
NA ~ X \ eS - e + ei(j ® eip S + eip®^ ) X 

and x is the operator of multiplication by Xj(a;), the characteristic function of J. There is 
a similar representation for £jv,4(0; J) |l5| pi) . This form for -Kjv.i indicates our change of 
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view; namely, we think of Kn,i as a 2 x 2 matrix with operator entries. We then manipulate 
these determinants of these operators to put them in the form 



EnA°\ J f = £w,2(0; J) det ( I - ^ a k <g> (i k 



k=l 



for (3 = 1, 4. (Of course, the ajt's and /3/-'s depend upon the ensemble.) The last determinant 
is evaluated using the general formula Q 

det ^/ - ^ a fe (g) /3 fe ^ = det (S jjk - ("j, Afc)) 3 -,k=i, n 

where (aj,(3 k ) denotes the inner product. 

We then derive differential equations for the inner products introduced by the last 
determinant. For J = (i, oo), Ep(Q;J) is again the distribution function of the largest 
eigenvalue, F]sr^(t), in cither the GOE ((3 = 1) or GSE (j3 = 4). Again we take the edge 
scaling limit: 

Here a is the standard deviation of the Gaussian distribution on the off-diagonal matrix 
elements. Our choice corresponds to a = 1/V2, but as the notation suggests, Fp(s) is 
independent of a. Our final results are 



Fx(s) 2 = F 2 (s) expl-/ q{x)dx), (26) 



F^s/^fsf = F 2 (s)cosh 2 Q^ q{x)dx 
and F 2 and q are as before. 



(27) 



3.1 Remarks 

1. Strictly speaking these distributions are only applicable in the limit that the size of 
the matrices, N, tends to infinity. In practice they are good approximations once 
N > 200. Some comparisons with finite N simulations are in |47j| . 

2. Table 1 displays some statistics of Fp and Figure 1 graphs the densities fp(s) = 
dFp/ds. 

3. As has been established for the edge scaling limit of unitary matrix ensembles |^|, O, 
, we expect the distributions F\ and F4 to be the generic distributions at the edge 

of the spectrum (no "fine tuning" of the potential) for a large class of orthogonal and 
symplectic matrix ensembles. This universality has been proved for the orthogonal 
and symplectic ensembles in the case of a quartic potential [[l2| . In |3(J the distribution 
of the appropriately centered and normalized largest eigenvalue of a p variate Wishart 
distribution on n degrees of freedom is shown to converge, as n — > 00, p — > 00, such 
that n/p is fixed, to F\. 
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Table 1: The mean (up), standard deviation (cxg), skewness (Sp) and kurtosis (Kp) of Fp. 





M/9 




Sp 


Kp 


i 


-1.20653 


1.2680 


0.293 


0.165 


2 


-1.77109 


0.9018 


0.224 


0.093 


4 


-2.30688 


0.7195 


0.166 


0.050 



4. The past few years has seen an explosion of activity connecting random matrix theory 
and combinatorial problems related to the Robinson-Schensted-Knuth algorithm. At 
the level of limit laws, this began with the work of Baik, Deift and Johansson |Q (for a 
review of this work, see |l2| ) who discovered that Fi is the limiting distribution of the 
(appropriately centered and normalized) length of the longest incrcasingsubsequence 
of a random permutation. For related developments see g, [| [§, ||, [7J |§, ^9|, |2(j, Q 
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